Context-sensitive learning methods for text categorization
نویسندگان
چکیده
منابع مشابه
Context-sensitive Learning Methods for Text Categorization
Two recently implemented machine learning algorithms, RIPPER and sleeping experts , are evaluated on a number of large text categorization problems. These algorithms both construct classiiers that allow the \context" of a word w to aaect how (or even whether) the presence or absence of w will contribute to a classiication. However , RIPPER and sleeping experts diier radically in many other resp...
متن کاملLearning Context for Text Categorization
This paper describes our work which is based on discovering context for text document categorization. The document categorization approach is derived from a combination of a learning paradigm known as relation extraction and an technique known as context discovery. We demonstrate the effectiveness of our categorization approach using reuters 21578 dataset and synthetic real world data from spor...
متن کاملLearning-Free Text Categorization
In this paper, we report on the fusion of simple retrieval strategies with thesaural resources in order to perform large-scale text categorization tasks. Unlike most related systems, which rely on training data in order to infer text-to-concept relationships, our approach can be applied with any controlled vocabulary and does not use any training data. The first classification module uses a tra...
متن کاملLarge Margin Winnow Methods for Text Categorization
ABSTRACT The SNoW (Sparse Network of Winnows) ar hite ture has re ently been su essful applied to a number of natural language pro essing (NLP) problems. In this paper, we propose large margin versions of the Winnow algorithms, whi h we argue an potentially enhan e the performan e of basi Winnows (and hen e the SNoW ar hite ture). We demonstrate that the resulting methods a hieve performan e om...
متن کاملImproving Methods for Single-label Text Categorization
As the volume of information in digital form increases, the use of Text Categorization techniques aimed at finding relevant information becomes more necessary. To improve the quality of the classification, I propose the combination of different classification methods. The results show that k-NN-LSI, the combination of k-NNwith LSI, presents an average Accuracy on the five datasets that is highe...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: ACM Transactions on Information Systems
سال: 1999
ISSN: 1046-8188,1558-2868
DOI: 10.1145/306686.306688